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PROTEINS EXPRESSED BY MYCOBACTERIUM TUBERCULOSIS AND NOT 
BY BCG AND THEIR USE AS DIAGNOSTIC REAGENTS AND VACCINES 

The invention is in the field of tuberculosis and, 
specifically, reagents useful for generating immune 
5 responses to Mycobacterium tuberculosis and for 

diagnosing infection and disease in a subject that has 
been exposed to M. tuberculosis . 

Background of the Invention 

10 Tuberculosis infection continues to be a world- 

wide health problem. This situation has recently been 
greatly exacerbated by the emergence of multi-drug 
resistant strains of M. tuberculosis and the 
international AIDS epidemic. It has thus become 

15 increasingly important that effective vaccines against 
and reliable diagnostic reagents for M. tuberculosis be 
produced. 

U.S. application no. 08/796,792 is incorporated 
herein by reference in it entirety. 

20 Summary of the Invention 

The invention is based on the inventor f s discovery 
that a polypeptide encoded by an open reading frame (ORE) 
in the genome of M. tuberculosis that is absent from the 
genome of the Bacille Calmette Guerin (BCG) strain of M. 

25 bovis elicited a delayed-type hypersensitivity response 
in animals infected with M. tuberculosis but not in 
animals sensitized with BCG. Thus proteins encoded by 
ORFs present in the genome of M. tuberculosis but absent 
from the genome of BCG represent reagents that are useful 

30 in discriminating between M. tuberculosis and BCG and, 
in particular, for diagnostic methods (e.g., skin tests 
and in vitro assays for M. tuberculosis-specific 
antibodies and lymphocyte responsiveness) which 
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discriminate between exposure of a subject to M. 
tuberculosis and vaccination with BCG. The invention 
features these polypeptides, functional segments thereof, 
DNA molecules encoding either the polypeptides or the 
5 functional segments, vectors containing the DNA 

molecules, cells transformed by the vectors, compositions 
containing one or more of any of the above polypeptides, 
functional segments, or DNA molecules, and a variety of 
diagnostic, therapeutic, and prophylactic (vaccine) 

10 methodologies utilizing the foregoing. 

Specifically, the invention features an isolated 
DNA molecule containing a DNA sequence encoding a 
polypeptide with a first amino acid sequence that can be 
the amino acid sequence of the polypeptide MTBN1, MTBN2, 

15 MTBN3, MTBN4, MTBN5, MTBN6, MTBN7 or MTBN8, as depicted 
in Fig. 1, or a second amino acid sequence identical to 
the first amino acid sequence with conservative 
substitutions; the polypeptide has Mycobacterium 
tuberculosis specific antigenic and immunogenic 

20 properties. Also included in the invention is an isolated 
portion of the above DNA molecule. The portion of the 
DNA molecule encodes a segment of the polypeptide shorter 
than the full-length polypeptide, and the segment has 
Mycobacterium tuberculosis specific antigenic and 

25 immunogenic properties. Other embodiments of the 

invention are vectors containing the above DNA molecules 
and transcriptional and translational regulatory 
sequences operationally linked to the DNA sequence; the 
regulatory sequences allow for expression of the 

30 polypeptide or functional segment encoded by the DNA 
sequence in a cell. The invention encompasses cells 
(e.g., eukaryotic and prokaryotic cells) transformed with 
the above vectors. 

The invention encompasses compositions containing 

35 any of the above vectors and a pharmaceutical^ 

acceptable diluent or filler. Other compositions (to be 
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used, for example, as DNA vaccines) can contain at least 
two (e.g., three, four, five, six, seven, eight, nine, 
ten, twelve, fifteen, or twenty) DNA sequences, each 
encoding a polypeptide of the Mycobacterium tuberculosis 
5 complex or a functional segment thereof, with the DNA 
sequences being operationally linked to transcriptional 
and translational regulatory sequences which allow for 
expression of each of the polypeptides in a cell of a 
vertebrate. In such compositions, at least one (e.g., 

10 two, three, four, five, six, seven, or eight) of the DNA 
sequences is one of the above DNA molecules of the 
invention. The encoded polypeptides will preferably be 
those not encoded by the genome of cells of the BCG 
strain of M. bovis. 

15 The invention also features an isolated 

polypeptide with a first amino acid sequence that can be 
rhe sequence of the polypeptide MTBN1, MTBN2 , MTBN3, 
MTBN4, MTBN5, MTBN6, MTBN7 or MTBN8 as depicted in Fig. 
1, or a second amino acid sequence identical to the first 

20 amino acid sequence with conservative substitutions. The 
polypeptide has Mycobacterium tuberculosis specific 
antigenic and immunogenic properties. Also included in 
the invention is an isolated segment of this polypeptide, 
the segment being shorter than the full-length 

25 polypeptide and having Mycobacterium tuberculosis 

specific antigenic and immunogenic properties. Other 
embodiments are compositions containing the polypeptide, 
or functional segment, and a pharmaceutically acceptable 
diluent or filler. Compositions of the invention can 

30 also contain at least two (e.g., three, four, five, six, 
seven, eight, nine, ten, twelve, fifteen, or twenty) 
polypeptides of the Mycobacterium tuberculosis complex, 
or functional segments thereof, with at least one of the 
at least two (e.g., two, three, four, five, six, seven, 

35 or eight) polypeptides having the sequence of one of the 
above described polypeptides of the invention. The 
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polypeptides will preferably be those not encoded by the 
genome of cells of the BCG strain of M. bovis. 

The invention also features methods of diagnosis • 
One embodiment is a method involving: (a) administration 
5 of one of the above polypeptide compositions to a subject 
suspected of having or being susceptible to Mycobacterium 
tuberculosis infection; and (b) detecting an immune 
response in the subject to the composition, as an 
indication that the subject has or is susceptible to 
10 Mycobacterium tuberculosis infection. An example of such 
a method is a skin test in which the test substance 
(e.g., compositions containing one or more of MTBN1- 
MTBN8) is injected intradermally into the subject and in 
which a skin delayed-type hypersensitivity response is 
15 tested for. Another embodiment is a method that 

involves: (a) providing a population of cells containing 
CD4 T lymphocytes from a subject; (b) providing a 
population of cells containing antigen presenting cells 
(APC) expressing a major histocompatibility complex (MHC) 
20 class II molecule expressed by the subject; (c) 

contacting the CD4 lymphocytes of (a) with the APC of (b) 
in the presence of one or more of the polypeptides, 
functional segments, and or polypeptide compositions of 
the invention; and (d) determining the ability of the CD4 
25 lymphocytes to respond to the polypeptide, as an 

indication that the subject has or is susceptible to 
Mycobacterium tuberculosis infection. Another diagnostic 
method of the invention involves: (a) contacting a 
polypeptide, a functional segment, or a 
30 polypeptide/functional segment composition of the 
invention with a bodily fluid of a subject; 
(b) detecting the presence of binding of antibody to the 
polypeptide, functional segment, or 
polypeptide/functional segment composition, as an 
35 indication that the subject has or is susceptible to 
Mycobacterium tuberculosis infection. 
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Also encompassed by the invention are methods of 
vaccination. These methods involve administration of any 
of the above polypeptides, functional segments, or DNA 
compositions to a subject. The compositions can be 
administered alone or with one or more of the other 
compositions . 

As used herein, an "isolated DNA molecule" is a 
DNA which is one or both of: not immediately contiguous 
with one or both of the coding sequences with which it is 
immediately contiguous (i.e., one at the 5' end and one 
at the 3» end) in the naturally-occurring genome of the 
organism from which the DNA is derived; or which is 
substant ialiy free of DNA sequence with which it occurs 
in the organism from which the DNA is derived. The term 
15 includes, for example, a recombinant DNA which 

incorporated into a vector, e.g., into an autonomously 
replicating plasmid or virus, or into the genomic DNA of 
a prokaryote or eukaryote, or which exists as a separate 
molecule (e.g., a cDNA or a genomic fragment produced by 
20 PCR or restriction endonuclease treatment) independent of 
other DNA sequences. Isolated DNA also includes a 
recombinant DNA which is part of a hybrid DNA encoding 
additional M. tuberculosis polypeptide sequences. 

"DNA molecules" include cDNA, genomic DNA, and 
25 synthetic (e.g., chemically synthesized) DNA . Where 

single-stranded, the DNA molecule may be a sense strand 
or an antisense strand. 

An "isolated polypeptide" of the invention is a 
polypeptide which either has no naturally-occurring 
30 counterpart, or has been separated or purified from 
components which naturally accompany it, e.g., in M. 
tuberculosis bacteria. Typically, the polypeptide is 
considered "isolated" when it is at least 70%, by dry 
weight, free from the proteins and naturally-occurring 
35 organic molecules with which it is naturally associated. 
Preferably, a preparation of a polypeptide of the 
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invention is at least 80%, more preferably at least 90%, 
and most preferably at least 99%, by dry weight, the 
peptide of the invention. Since a polypeptide that is 
chemically synthesized is, by its nature, separated from 
5 the components that naturally accompany it, the synthetic 
polypeptide is "isolated." 

An isolated polypeptide of the invention can be 
obtained, for example, by extraction from a natural 
source (e.g., M. tuberculosis bacteria); by expression of 
10 a recombinant nucleic acid encoding the polypeptide; or 

by chemical synthesis. A polypeptide that is produced in 
a cellular system different from the source from which it 
naturally originates is "isolated, " because it will be 
j*»5 separated from components which naturally accompany it. 

D 15 The extent of isolation or purity can be measured by any 

appropriate method, e.g., column chromatography, 
§7j polyacrylamide gel electrophoresis, or HPLC analysis. 

C3 The polypeptides may contain a primary amino acid 

¥s? sequence that has been modified from those disclosed 

f=s 20 herein. Preferably these modifications consist of 

■=SSf 

yi conservative amino acid substitutions. Conservative 

*t substitutions typically include substitutions within the 

□ following groups: glycine and alanine; valine, 

vi isoleucine, and leucine; aspartic acid and glutamic acid; 

2 5 asparagine and glutamine; serine and threonine; lysine 
and arginine; and phenylalanine and tyrosine. 

The terms "protein" and "polypeptide" are used 
herein to describe any chain of amino acids, regardless 
of length or post-translational modification (for 
30 example, glycosylation or phosphorylation) . Thus, the 
term "Mycobacterium tuberculosis polypeptide" includes 
full-length, naturally occurring Mycobacterium 
tuberculosis protein, as well a recombinantly or 
synthetically produced polypeptide that corresponds to a 
35 full-length naturally occurring Mycobacterium 

tuberculosis protein or to particular domains or portions 
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of a naturally occurring protein. The term also 
encompasses a mature Mycobacterium tuberculosis 
polypeptide which has an added amino-terminal methionine 
(useful for expression in prokaryotic cells) or any short 
5 amino acid sequences useful for protein purification by 
affinity chromatography, e.g., polyhistidine for 
purification by metal chelate chromatography. 

As used herein, "immunogenic" means capable of 
activating a primary or memory immune response. Immune 

10 responses include responses of CD4 + and CD8+ T 

lymphocytes and B-lymphocytes . In the case of T 
lymphocytes, such responses can be proliferative, and/or 
cytokine (e.g., interleukin ( IL) -2 , IL-3, IL-4, IL-5, IL- 
6, IL-12, IL-13, IL-15, tumor necrosis factor-a (TNF-a) , 

15 or interferon-y (IFN-y) ) -producing, or they can result in 
generation of cytotoxic T-lymphocytes (CTL) . B- 
lymphocyte responses can be those resulting in antibody 
production by the responding B lymphocytes. 

As used herein, "antigenic" means capable of being 

20 recognized by either antibody molecules or antigen- 
specific T cell receptors (TCR) on activated effector T 
cells (e.g., cytokine-producing T cells or CTL). 

Thus, polypeptides that have "Mycobacterium 
tuberculosis specific antigenic properties" are 

25 polypeptides that: (a) can be recognized by and bind to 
antibodies elicited in response to Mycobacterium 
tuberculosis organisms or wild-type Mycobacterium 
tuberculosis molecules (e.g., polypeptides); or (b) 
contain subsequences which, subsequent to processing of 

30 the polypeptide by appropriate antigen presenting cells 
(APC) and bound to appropriate major histocompatibility 
complex (MHC) molecules, are recognized by and bind to 
TCR on effector T cells elicited in response to 
Mycobacterium tuberculosis organisms or wild-type 

35 Mycobacterium tuberculosis molecules (e.g., 
polypeptides) . 
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As used herein, polypeptides that have 
"Mycobacterium tuberculosis specific immunogenic 
properties" are polypeptides that: (a) can elicit the 
production of antibodies that recognize and bind to 

5 Mycobacterium tuberculosis organisms or wild-type 
Mycobacterium tuberculosis molecules (e.g., 
polypeptides); or (b) contain subsequences which, 
subsequent to processing of the polypeptide by 
appropriate antigen presenting cells (APC) and bound to 

0 appropriate major histocompatibility complex (MHC) 

molecules on the surface of the APC, activate T cells 
with TCR that recognize and bind to peptide fragments 
derived by processing by APC of Mycobacterium 
tuberculosis organisms or wild-type Mycobacterium 

5 tuberculosis molecules (e.g., polypeptides) and bound to 
MHC molecules on the surface of the APC. The immune 
responses elicited in response to the immunogenic 
polypeptides are preferably protective. As used herein, 
"protective" means preventing establishment of an 

0 infection or onset of a disease or lessening the severity 
of a disease existing in a subject. "Preventing" can 
include delaying onset, as well as partially or 
completely blocking progress of the disease. 

As used herein, a "functional segment of a 

5 Mycobacterium tuberculosis polypeptide" is a segment of 
the polypeptide that has Mycobacterium tuberculosis 
specific antigenic and immunogenic properties. 

Where a polypeptide, functional segment of a 
polypeptide, or a mixture of polypeptides and/or 

0 functional segments have been administered (e.g., by 
intradermal injection) to a subject for the purpose of 
testing for a M. tuberculosis infection or susceptibility 
to such an infection, "detecting an immune response" 
means examining the subject for signs of a immunological 

5 reaction to the administered material, e.g., reddening or 
swelling of the skin at the site of an intradermal 
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1 : 



10 



injection. Where the subject has antibodies to the 
administered material, the response will generally be 
rapid, e.g., 1 minute to 2 4 hours. On the other hand, a 
memory or activated T cell reaction of pre-immunized T 
lymphocytes in the subject is generally slower, appearing 
only after 24 hours and being maximal at 24-96 hours. 

As used herein, a "subject" can be a human subject 
or a non-human mammal such as a non-human primate, a 
horse, a bovine animal, a pig, a sheep, a goat, a dog, a 
cat, a rabbit, a guinea pig, a hamster, a rat, or a 
mouse . 

Unless otherwise defined, all technical and 
scientific terms used herein have the same meaning as 
commonly understood by one of ordinary skill in the art 

15 to which this invention pertains. In case of conflict, 
the present document, including definitions, will 
control. Preferred methods and materials are described 
below, although methods and materials similar or 
equivalent to those described herein can be used in the 

20 practice or testing of the present invention. Unless 
otherwise indicated, these materials and methods are 
illustrative only and are not intended to be limiting. 
All publications, patent applications, patents and other 
references mentioned herein are illustrative only and not 

25 intended to be limiting. 

Other features and advantages of the invention, 
e.g., methods of diagnosing M. tuberculosis infection, 
will be apparent from the following description, from the 
drawings and from the claims. 

30 Brief Description of the Drawings 

Figure 1 is a depiction of the amino acid 
sequences of M. tuberculosis polypeptides MTBN1-MTBN8 . 

Figure 2 is a depiction of the nucleotide 
sequences of the coding regions (mtbnl-mtbn8 ) encoding 
35 MTBN1-MTBN8. 
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Figure 3 is a bar graph showing the delayed-type 
hypersensitivity responses induced by intradermal 
injection of 3 different test reagents in female guinea 
pigs that had been either infected with M. tuberculosis 
cells or sensitized with BCG or M. avium cells. 

Detailed Description 
The genome of M. tuberculosis [Cole et al . (1998) 
Nature 393 : 537-544] contains open reading frames (ORFs) 
that have been deleted from the avirulent BCG strain. 
The polypeptides encoded by these ORFs are designated 
herein "M. tuberculosis BCG Negative" polypeptides 
( "MTBN" ) and the ORFs are designated "mtbn." The 
invention is based on the discovery that a MTBN 
polypeptide (MTBN 4 ) elicited a skin response in animals 
15 infected with M. tuberculosis, but not in animals 
sensitized to either BCG or M. avium, a non-M. 
tuberculosis-complex strain of mycobacteria (see Example 
1 below). These findings indicate that MTBN (e.g., 
MTBN1-MTBN8) can be used in diagnostic tests that 
20 discriminate infection of a subject by M. tuberculosis 
from exposure to both mycobacteria other than the M. 
tuberculosis-complex and BCG. The M. tuberculosis- 
complex includes M. tuberculosis, M. bovis, M. microti, 
and M. africanum. Thus they can be used to discriminate 
25 subjects exposed to M. tuberculosis, and thus potentially 
having or being in danger of having tuberculosis, from 
subjects that have been vaccinated with BCG, the most 
widely used tuberculosis vaccine. Diagnostic assays that 
are capable of such discrimination represent a major 
advance that will greatly reduce wasted effort and 
consequent costs resulting from further diagnostic tests 
and/or therapeutic procedures in subjects that have given 
positive results in less discriminatory diagnostic tests. 
Furthermore, the results in Example 1 show that MTBN4, as 
35 expressed by whole viable M. tuberculosis organisms, is 
capable of inducing a strong immune response in subjects 
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infected with the organisms and thus has the potential to 
be a vaccine. 

The MTBN polypeptides of the invention include, 
for example, polypeptides encoded within the RD1, RD2, 
5 and RD3 regions of the M. tuberculosis genome [Mahairas 
et al. (1996) J. Bacterid. 178:1274-1282]. Of particular 
interest are polypeptides encoded by ORFs within the RD1 
region of the M. tuberculosis genome. However, the 
invention is not restricted to the RD1, RD2, and RD3 

10 region encoded polypeptides and includes any polypeptides 
encoded by ORFs contained in the genome of one or more 
members of the M. tuberculosis genome and not contained 
in the genome of BCG . The amino acid sequences of MTBN1- 
MTBN8 are shown in Fig. 1 and the nucleotide sequences of 

15 mtbnl-mtbn8 are shown in Fig. 2. 

The invention encompasses: (a) isolated DNA 
molecules containing mtbn sequences (e.g., mtbnl-mtbn8) 
encoding MTBN polypeptides (e.g., MTBN1-MTBN8) and 
isolated portions of such DNA molecules that encode 

20 polypeptide segments having antigenic and immunogenic 
properties (i.e., functional segments); (b) the MTBN 
polypeptides themselves (e.g., MTBN1-MTBN8) and 
functional segments of them; (c) antibodies (including 
antigen binding fragments, e.g., F(ab') 2 , Fab, Fv, and 

25 single chain Fv fragments of such antibodies) that bind 
to the MTBN polypeptides (e.g., MTBN1-MTBN8) and 
functional segments; (d) nucleic acid molecules (e.g., 
vectors) containing and capable of expressing one or more 
of the mtbn (e.g., mtbnl-mtbn8) sequences and portions of 

30 DNA molecules; (e) cells (e.g., bacterial, yeast, insect, 
or mammalian cells) transformed by such vectors; 
(f) compositions containing vectors encoding one or more 
M. tuberculosis polypeptides (or functional segments) 
including both the MTBN (e.g., MTBN1-MTBN8 ) polypeptides 

35 (or functional segments thereof) and previously described 
AT. tuberculosis polypeptides such as ESAT-6, 14 kDa 



-11- 



WO 00/66157 



PCT/US00/12257 



antigen, MPT63 , 19 kDa antigen, MPT64 , MPT51, MTC28, 38 
kDa antigen, 45/47 kDa antigen, MPB70, Ag85 complex, 
MPT53, and KatG (see also U.S. application no. 
08/796,792); (g) compositions containing one or more M. 
5 tuberculosis polypeptides (or functional segments), 
including both the polypeptides of the invention and 
previously described M. tuberculosis polypeptides such as 
those described above; (h) compositions containing one or 
more of the antibodies described in (c) ; (i) methods of 
10 diagnosis involving either (1) administration (e.g., 
intradermal injection) of any of the above polypeptide 
compositions to a subject suspected of having or being 
susceptible to M. tuberculosis infection, (2) in vitro 
testing of lymphocytes (B-lymphocytes , CD4 T lymphocytes, 

15 and CD8 T lymphocytes) from such a subject for 

responsiveness (e.g., by measuring cell proliferation, 
antibody production, cytokine production, or CTL 
activity) to any of the above polypeptide compositions, 
(3) testing of a bodily fluid (e.g., blood, saliva, 

20 plasma, serum, urine, or semen or a lavage such as a 
bronchoalveolar lavage, a vaginal lavage, or lower 
gastrointestinal lavage) for anribodies to the MTBN 
polypeptides (e.g., MTBN1-MTBN8) or functional segments 
thereof, or the above-described polypeptide compositions ; 

25 (4) testing of a bodily fluid (e.g., as above) for the 
presence of M. tuberculosis, MTBN (e.g., MTBN1-MTBN8) 
polypeptides or functional segments thereof, or the 
above-described polypeptide compositions in assays using 
the antibodies described in (c) ; and (5) testing of a 

30 tissue (e.g., lung or bronchial tissue) or a body fluid 
(e.g., as above) for the presence of nucleic acid 
molecules (e.g., DNA or RNA) encoding MTBN polypeptides 
(e.g., MTBN 1 -MTBN 8 ) (or portions of such a nucleic acid 
molecules) using nucleic acid probes or primers having 

35 nucleotide sequences of the nucleic molecules, portions 
of the nucleic molecules, or the complements of such 



-12- 



WO 00/66157 



PCT/US00/12257 



molecules; and (j) methods of vaccination involving 
administration to a subject of the compositions of either 
(f ) t Cg) f (h) or a combination of any two or even all 3 
compositions . 

5 With respect to diagnosis, purified MTBN proteins, 

functional segments of such proteins, or mixtures of 
proteins and/or the functional fragments have the above- 
described advantages of discriminating infection by M. 
tuberculosis from either infection by other bacteria, and 

10 in particular, non-pathogenic mycobacteria, or from 
exposure (by, for example, vaccination) to BCG. 
Furthermore, compositions containing the proteins, 
functional segments of the proteins, or mixtures of the 
proteins and/or the functional segments allows for 

15 improved quality control since "batch-to-batch" 

variability is greatly reduced in comparison to complex 
mixtures such as purified protein derivative (PPD) of 
tuberculin. 

The use of the above-described polypeptide and 

20 nucleic acid reagents for vaccination also provides for 
highly specific and effective immunization. Since the 
virulent M. tuberculosis polypeptides encoded by genes 
absent from avirulent BCG are likely to be mediators of 
virulence, immunity directed to them can be especially 

25 potent in terms of protective capacity. Where 

vaccination is performed with nucleic acids both In vivo 
and ex vivo methods can be used. In vivo methods involve 
administration of the nucleic acids themselves to the 
subject and ex vivo methods involve obtaining ceils 

30 (e.g., bone marrow cells or fibroblasts) from the 

subject, transducing the cells with the nucleic acids, 
preferably selecting or enriching for successfully 
transduced cells, and administering the transduced cells 
to the subject. Alternatively, the cells that are 

35 transduced and administered to the subject can be derived 
from another subject. Methods of vaccination and 
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diagnosis are described in greater detail in U.S. 
application no. 08/796,792 which is incorporated herein 
by reference in its entirety. 

The following example is meant to illustrate, not 
5 limit the invention. 

Example 1. MPBN4 Elicits a Specific Skin Reaction in 
Guinea Pigs Infected with M. tuberculosis 

Four groups of outbred female guinea pigs (18 per 
group) were used to test the usefulness of the MTBN4 
10 polypeptide as a M. tuberculosis-specific diagnostic 
reagent. The four groups were treated as follows. 

Group 1 animals were infected by aerosol with 
approximately 100 M. tuberculosis strain H37Rv cells. 
Group 2 animals were sensitized intradermally with 10 6 
15 live M. bovis BCG Japanese cells. 

Group 3 animals were sensitized intradermally with 10 6 
live M, avium cells. 

Group 4 animals were mock-sensitized by intradermal 
injection with saline. 

20 Seven weeks after infection or sensitization, the 

animals were injected intradermally with 1 yg of PPD (6 
animals from each group) , 2 yg of purified recombinant 
MPT64 (6 animals from each group), or 2 yg of MTBN4 {6 
animals from each group) . The diameter of the resulting 
25 erythema was measured 24 hours later. Data are expressed 
as mean diameter of erythema (in mm) and standard 
deviations are indicated (Fig. 3) . 

No erythema was detected in the group 4 animals 
with any test substance and thus no data are shown for 
this group. On the other hand, group 1 animals (solid 
bars) showed a significant response with all three test 
substances. Group 2 animals (open bars) showed a 
significant response to PPD and MPT64 but not MTBN4 . 



30 
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Group 3 animals showed a significant response to PPD only 
(hatched bars) . 

Thus, PPD which contains antigenic/immunogenic 
molecules common to the M. tuberculosis-complex as well 

5 as other mycobacterial strains, gave the least 

discriminatory results in that it induced responses in 
animals infected with or sensitized to mycobacteria of 
the M . tuberculosis-complex (M. tuberculosis and BCG) as 
well as another non-pathogenic mycobacterium (M. avium) . 

0 While MPT64, which is encoded and expressed by both M. 
tuberculosis and BCG, did not elicit a response in 
animals infected with M. avium, it did elicit responses 
in both the M. tuberculosis infected and the BCG 
sensitized animals. Finally, MTBN4 elicited a response 

5 in only the M. tuberculosis animals. Thus it induced the 
most specific response and, most importantly, allowed for 
discrimination between animals infected with M. 
tuberculosis and those sensitized to BCG . 

Although the invention has been described with 

0 reference to the presently preferred embodiment, it 

should be understood that various modifications can be 
made without departing from the spirit of the invention. 
Accordingly, the invention is limited only by the 
following claims. 
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What is claimed is: 

1 1. An isolated DNA molecule comprising a DNA 

2 sequence encoding a polypeptide with a first amino acid 

3 sequence selected from the group consisting of the amino 

4 acid sequences of the polypeptides MTBN1, MTBN2, MTBN3 , 

5 MTBN4 , MTBN5, MTBN6, MTBN7 , and MTBN8 , as depicted in 

6 Fig. 1, 

7 or a second amino acid sequence identical to said 

8 first amino acid sequence with conservative 

9 substitutions, 

10 wherein said polypeptide has Mycobacterium 

11 tuberculosis specific antigenic and immunogenic 

12 properties. 

1 2 . An isolated portion of the DNA molecule of 

2 claim 1, said portion encoding a segment of said 

3 polypeptide shorter than the full-length polypeptide, 

4 said segment having Mycobacterium tuberculosis specific 

5 antigenic and immunogenic properties. 

1 3. A vector comprising: 

2 (a) the DNA molecule of claim 1; and 

3 (b) transcriptional and translational regulatory 

4 sequences operationally linked to said DNA sequence, said 

5 regulatory sequences allowing for expression of the 

6 polypeptide encoded by said DNA sequence in a cell. 

1 4. A vector comprising: 

2 (a) the DNA molecule of claim 2; and 

3 (b) transcriptional and translational regulatory 

4 sequences operationally linked to said DNA sequence, said 

5 regulatory sequences allowing for expression of the 

6 polypeptide encoded by said DNA sequence in a cell. 

1 5. A cell transformed with the vector of claim 3. 

1 6. A cell transformed with the vector of claim 4. 
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2 7 . A composition comprising the vector of claim 3 

3 and a pharmaceutical^ acceptable diluent or filler. 

1 8. A composition comprising the vector of claim 4 

2 and a pharmaceutical^ acceptable diluent or filler. 

1 9. A composition comprising at least two DNA 

2 sequences, each encoding a polypeptide of the 

3 Mycobacterium tuberculosis complex that is not a 

4 polypeptide encoded by the genome of cells of the Bacille 

5 Calmette Guerin (BCG) strain of Mycobacteria bovls, said 

6 DNA sequences being operationally linked to 

7 transcriptional and translational regulatory sequences 

8 which allow for expression of each said polypeptide in a 

9 cell of a vertebrate, 

10 wherein at least one of said DNA sequences is a 

11 DNA molecule of claim 1. 

1 10. A composition comprising at least two DNA 

2 sequences, each encoding a functional fragment of a 

3 polypeptide of the Mycobacterium tuberculosis complex, 

4 said DNA sequences being operationally linked to 

5 transcriptional and translational regulatory sequences 

6 which allow for expression of each said polypeptide in a 

7 cell of a vertebrate, 

8 wherein at least one of said DNA sequences is a 

9 DNA molecule of claim 2. 

1 11. An isolated polypeptide with a first amino 

2 acid sequence selected from the group consisting of the 

3 sequences of the polypeptides MTBN1, MTBN2, MTBN3, MTBN4, 

4 MTBN5, MTBN6, MTBN7 , and MTBN8, as depicted in Fig. 1, 

5 or a second amino acid sequence identical to said 

6 firsr amino acid sequence with conservative 

7 substitutions, 

8 wherein said polypeptide has Mycobacterium 

9 tuberculosis specific antigenic and immunogenic 
10 properties. 
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1 12. An isolated segment of the polypeptide of 

2 claim 11, said segment being shorter than the full-length 

3 polypeptide and having Mycobacterium tuberculosis 

4 specific antigenic and immunogenic properties. 

1 13. A composition comprising the polypeptide of 

2 claim 11 and a pharmaceutically acceptable diluent or 

3 filler. 

1 14. A composition comprising a functional 

2 fragment of the polypeptide of claim 12 and a 

3 pharmaceutically acceptable diluent or filler. 

1 15. A composition comprising at least two 

2 polypeptides of the Mycobacterium tuberculosis complex, 

3 each polypeptide not being encoded by the genome of the 

4 cells of the BCG strain of Mycobacterium bovis, wherein 

5 at least one of said polypeptides is a polypeptide of 

6 claim 1 . 

1 16. A composition comprising functional fragments 

2 of at least two polypeptides of the Mycobacterium 

3 tuberculosis complex, each polypeptide not being encoded 

4 by the genome of cells of the Bacille Calmette Guerin 

5 (BCG) strain of Mycobacteria bovis, wherein at least one 

6 of said polypeptides is a segment of claim 2. 

1 17. A method of diagnosis comprising: 

2 (a) administration of the composition of claim 15 

3 to a subject suspected of having or being susceptible to 

4 Mycobacterium tuberculosis infection; and 

5 (b) detecting an immune response in said subject 

6 to said composition as an indication that said subject 

7 has or is susceptible to Mycobacterium tuberculosis 

8 infection. 
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18. A method of diagnosis comprising: 

(a) administration of the composition of claim 16 
to a subject suspected of having or being susceptible to 
Mycobacterium tuberculosis infection; and 

(b) detecting an immune response in said subject 
to said composition as an indication that said subject 
has or is susceptible to Mycobacterium tuberculosis 
infection . 

19. A method of diagnosis comprising: 

(a) providing a population of cells comprising CD4 
T lymphocytes from a subject; 

(b) providing a population of cells comprising 
antigen presenting cells (APC) expressing a major 
histocompatibility complex {MHC) class II molecule 
expressed by said subject; 

(c) contacting the CD4 lymphocytes of (a) with the 
APC of (b) in the presence of the polypeptide of claim 
12; and 

(d) determining the ability of said CD4 
lymphocytes to respond to said polypeptide, as an 
indication that said subject has or is susceptible to 
Mycobacterium tuberculosis infection. 

20. A method of diagnosis comprising: 

(a) providing a population of cells comprising CD4 
T lymphocytes from a subject; 

(b) providing a population of cells comprising 
antigen presenting cells (APC) expressing at least one 
major histocompatibility complex (MHC) class II molecule 
expressed by said subject; 

(c) contacting the CD4 lymphocytes of (a) with the 
APC of (b) in the presence of the segment of claim 12; 
and 

(d) determining the ability of said CD4 
lymphocytes to respond to said polypeptide, as an 



-19- 



WO 00/66157 



PCT/US00/12257 



13 indication that said subject has or is susceptible to 

14 Mycobacterium tuberculosis infection. 

1 21. A method of diagnosis comprising: 

2 (a) providing a population of cells comprising CD4 

3 T lymphocytes from a subject; 

4 (b) providing a population of cells comprising 

5 antigen presenting cells (APC) expressing at least one 

6 major histocompatibility complex (MHC) class II molecule 

7 expressed by said subject; 

8 (c) contacting the CD4 lymphocytes of (a) with the 

9 APC of (b) in the presence of the composition of claim 

10 15; and 

11 (d) determining the ability of said CD4 

12 lymphocytes to respond to said polypeptide, as an 

13 indication that said subject has or is susceptible to 

14 Mycobacterium tuberculosis infection. 

1 22. A method of diagnosis comprising: 

2 (a) providing a population of cells comprising CD4 

3 T lymphocytes from a subject ; 

4 (b) providing a population of ceils comprising 

5 antigen presenting cells (APC) expressing at least one 

6 major histocompatibility complex (MHC) class II molecule 

7 expressed by said subject; 

8 (c) contacting the CD4 lymphocytes of (a) with the 

9 APC of (b) in the presence of the composition of claim 

10 16; and 

11 (d) determining the ability of said CD4 

12 lymphocytes to respond to said polypeptide, as an 

13 indication that said subject has or is susceptible to 

14 Mycobacterium tuberculosis infection. 

1 23. A method of diagnosis comprising: 

2 (a) contacting the polypeptide of claim 11 with a 

3 bodily fluid of a subject; 
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4 (b) detecting the presence of binding of antibody 

5 to said polypeptide, as an indication that said subject 

6 has or is susceptible to Mycobacterium tuberculosis 
1 infection. 

1 2 4. A method of diagnosis comprising: 

2 (a) contacting the segment of claim 12 with a 

3 bodily fluid of a subject; 

4 (b) detecting the presence of binding of antibody 

5 to said polypeptide, as an indication that said subject 

6 has or is susceptible to Mycobacterium tuberculosis 

7 infection. 

1 25. A method of diagnosis comprising: 

2 (a) contacting the composition of claim 15 with a 

3 bodily fluid of a subject; 

4 (b) detecting the presence of binding of antibody 

5 to said composition, as an indication that said subject 

6 has or is susceptible to Mycobacterium tuberculosis 

7 infection. 

1 2 6. A method of diagnosis comprising: 

2 (a) contacting the composition of claim 16 with a 

3 bodily fluid of a subject ; 

4 (b) detecting the presence of binding of antibody 

5 to said composition, as an indication that ^aid subject 

6 has or is susceptible to Mycobacterium tuberculosis 

7 infection. 

1 27 . A method of vaccination comprising 

2 administration of the composition of claim 7 to a 

3 subject. 

1 28. A method of vaccination comprising 

2 administration of the composition of claim 8 to a 

3 subject. 
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1 29. A method of vaccination comprising 

2 administration of the composition of claim 9 to a 

3 subject. 

1 30. A method of vaccination comprising 

2 administration of the composition of claim 10 to a 

3 subject. 

1 31. A method of vaccination comprising 

2 administration of the composition of claim 13 to a 

3 subject. 

1 32. A method of vaccination comprising 

2 administration of the composition of claim 14 to a 

3 subject. 

1 33. A method of vaccination comprising 

2 administration of the composition of claim 15 to a 

3 subject. 

1 34 . A method of vaccination comprising 

2 administration of the composition of claim 16 to a 

3 subject. 



-22- 



WO 00/66157 



10/00 9383 

PCT/USOO/12257 



FIG. 1 



MTBN1 

MTAEPEVRTLREWLDQLGTAESRAYKMWLPPLTNPVPLNELIARDRRQPLRFALGIMDE 
PRRHLQDVWGVDVSGAGGNIGIGGAPQTGKSTLLQTMVMSAAATHSPRNVQFYCIDLGGG 
GLIYLENLPHVGGVANRSEPDKVNRWAEMQAVMRQRETTFKEHRVGSIGMYRQLRDDPS 
QPVASDPYGDVFLI IDGWPGFVGEFPDLEGQVQDLAAQGLAFGVHVI I STPRWTELKSRV 
RDYLGTKI EFRLGDVNETQIDRI TREI PANRPGRAVSMEKHHLMI GVPRFDGVHSADNLV 
EAITAGVTQIASQHTEQAPPVRVLPERIHLHELDPNPPGPESDYRTRWEIPIGLRETDLT 
PAHCHMHTNPHLLIFGAAKSGKTTIAHAIARAICARNSPQQVRFMLADYRSGLLDAVPDT 
HLLGAGAINRNSASLDEAVQALAVNLKKRLPPTDLTTAQLRSRSWWSGFDVVLLVDDWHM 
IVGAAGGMPPMAPLAPLLPAAADIGLHIIVTCQMSQAYKATMDKFVGAAFGSGAPTMFLS 
GEKQEFPS SEFKVKRRPPGQAFLVSPDGKEVI QAPYIEPPEEVFAAPPSAG* 



MTBN2 

MEKMSHDPIAADIGTQVSDNALHGVTAGSTALTSVTGLVPAGADEVSAQAATAFTSEGIQ 
LLASNASAQDQLHRAGEAVQDVARTYSQIDDGAAGVFAE* 



MTBN3 

MLWHAMPPELNTARLMAGAGPAPMLAAAAGWQTLSAALDAQAVELTARLNSLGEAWTGGG 

SDKALAAATPMWWLQTAS TQAKTRAMQATAQAAAYTQAMATTP SLPE IAANH I TQAVLT 

ATNFFGINTIPIALTEMDYFIRMWNQAALAMEVYQAETAVNTLFEKLEPMASILDPGASQ 

STTNPIFGMPSPGSSTPVGQLPPAATQTLGQLGEMSGPMQQLTQPLQQVTSLFSQVGGTG 

GGNPADEEAAQMGLLGTSPLSNHPLAGGSGPSAGAGLLRAESLPGAGGSLTRTPLMSQLI 

EKPVAPSVMPAAAAGSSATGGAAPVGAGAMGQGAQSGGSTRPGLVAPAPLAQEREEDDED 

DWDEEDDW* 



MTBN4 

MAEMKTDAATLAQEAGNFERISGDLKTQIDQVESTAGSLQGQWRGAAGTAAQAAVVRFQE 
AANKQKQELDEISTNIRQAGVQYSRADEEQQQALSSQMGF* 



MTBN5 

MAADYDKLFRPHEGMEAPDDMAAQPFFDPSASFPPAPASANLPKPNGQTPPPTSDDLSER 

FVSAPPPPPPPPPPPPPTPMPIAAGEPPSPEPAASKPPTPPMPIAGPEPAPPKPPTPPMP 

IAGPEPAPPKPPTPPMPIAGPAPTPTESQLAPPRPPTPQTPTGAPQQPESPAPHVPSHGP 

HQPRRTAPAPPWAKMPIGEPPPAPSRPSASPAEPPTRPAPQHSRRARRGHRYRTDTERNV 

GKVATGPS IQARLRAEEASGAQLAPGTEPS PAPLGQPRS YLAPPTRPAPTEPPPS PS PQR 

NSGRRAERRVHPDLAAQHAAAQPDSITAATTGGRRRKRAAPDLDATQKSLRPAAKGPKVK 

KVKPQKPKATKPPKWSQRGWRHWVHALTRINLGLSPDEKYELDLHARVRRNPRGSYQIA 

WGLKGGAGKTTLTAALGSTLAQVRADRILALDADPGAGNLADRVGRQSGATIADVLAEK 

ELSHYNDIRAHTSVNAVNLEVLPAPEYSSAQRALSDADWHFIADPASRFYNLVLADCGAG 

FFDPLTRGVLSTVSGWWASVSIDGAQQASVALDWLRNNGYQDLASRACWINHIMPGE 

PNVAVKDLVRHFEQQVQPGRVWMPWDRHIAAGTEISLDLLDPIYKRKVLELAAALSDDF 
ERAGRR* 
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MTBN6 

LSAPAVAAGPTAAGATAARPATTRVTILTGRRMTDLVLPAAVPMETYIDDTVAVLSEVLE 

DTPADVLGGFDFTAQGVWAFARPGSPPLKLDQSLDDAGVVDGSLLTLVSVSRTERYRPLV 

EDVIDAIAVLDESPEFDRTALNRFVGAAIPLLTAPVIGMAMRAWWETGRSLWWPLAIGIL 

GIAVLVGSFVANRFYQSGHLAECLLVTTYLLIATAAALAVPLPRGVNSLGAPQVAGAATA 

VLFLTIiMTRGGPRKRHEIASFAVITAIAVIAAAAAFGYGYQDWPAGGIAFGLFIVTNAA 

KLTVAVARIALPPIPVPGETVDNEELLDPVATPEATSEETPTWQAIIASVPASAVRLTER 

SKLAKQLLIGYVTSGTLILAAGAIAWVRGHFFVHSLVVAGLITTVCGFRSRLYAERWCA 

WALLAATVAI PTGLTAKLI I W YPH YAWL LL S V YL T VALVAL WVG S MAH VRR V S PVVKRT 

LELIDGAMIAAI I PMLLWITGVYDTVRNIRF* 

MTBN7 

MAEPLAVDPTGLSAAAA.KLAGLVFPQPPAPIAVSGTDSWAAINETMPSIESLVSDGLPG 
VKAALTRTASNMNAAADVYAKTDQSLGTSLSQYAFGSSGEGLAGVASVGGQPSQATQLLS 
TPVSQVTTQLGETAAELAPRWATVPQLVQLAPHAVQMSQNASPIAQTISQTAQQAAQSA 
QGGSGPMPAQLASAEKPATEQAEPVHEVTNDDQGDQGDVQPAEVVAAARDEGAGASPGQQ 
PGGGVPAQAMDTGAGARPAA.SPLAAPVDPSTPAPSTTTTL* 

MTBN8 

MSITRPTGSYARQMLDPGGWVEADEDTFYDRAQEYSQVLQRVTDVLDTCRQQKGHVFEGG 
LW S GGAANAANGALGAN I NQLMTLQD YL AT VI TWHRH I AGL I E QAKS D I GNNVD GAQRE I 

DILENDPSLDADERHTAINSLVTATHGANVSLVAETAERVLESKNWKPPKNALEDLLQQK 

SPPPPDVPTLWPSPGTPGTPGTPITPGTPITPGTPITPIPGAPVTPITPTPGTPVTPVT 

PGKPVTPVTPVKPGTPGEPTPITPVTPPVAPATPATPATPVTPAPAPHPQPAPAPAPSPG 

PQPVTPATPGPSGPATPGTPGGEPAPHVKPAALAEQPGVPGQHAGGGTQSGPAHADESAA 

S VTPAAASGVPGARAAAAAPSGTAVGAGARS SVGTAAAS GAGS HAATGRAP VATSDKAAA 

PSTRAA.SARTAPPARPPSTDHIDKPDRSESADDGTPVSMIPVSAARAARDAATAAASARQ 

RGRGDALRLARR I AAALNAS DNNAGDYGFF W I TAVT TD G S I WAN S YGL A Y I PD GME L PN 

KVYLASADHAIPVDEIARCATYPVIAVQAWAA.FHDMTLRA.VIGTAEQLASSDPGVAKIVL 

EPDDIPESGKMTGRSRLEWDPSAAAQLADTTDQRLLDLLPPAPVDVNPPGDERHMLWFE 

LMKPMTSTATGREAAHLRAFRAYAAHSQEIALHQAHTATDAAVQRVAVADWLYWQYVTGL 
LDRALAAAC* 
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FIG, 2 



mtbnl 

1 atgactgctg aaccggaagt acggacgctg cgcgaggttg tgctggacca 
51 gctcggcact gctgaatcgc gtgcgtacaa gatgtggctg ccgccgttga 
101 ccaatccggt cccgctcaac gagctcatcg cccgtgatcg gcgacaaccc 
.151 ctgcgatttg ccctggggat catggatgaa ccgcgccgcc atctacagga 
201 tgtgtggggc gtagacgttt ccggggccgg cggcaacatc ggtattgggg 
251 gcgcacctca aaccgggaag tcgacgctac tgcagacgat ggtgatgtcg 
301 gccgccgcca cacactcacc gcgcaacgtt cagttctatt gcatcgacct 
351 aggtggcggc gggctgatct atctcgaaaa ccttccacac gtcggtgggg 
401 tagccaatcg gtccgagccc gacaaggtca accgggtggt cgcagagatg 
451 caagccgtca tgcggcaacg ggaaaccacc ttcaaggaac accgagtggg 
501 ctcgatcggg atgtaccggc agctgcgtga cgatccaagt caacccgttg 
551 cgtccgatcc atacggcgac gtctttctga tcatcgacgg atggcccggt 
601 tttgtcggcg agttccccga ccttgagggg caggttcaag atctggccgc 
651 ccaggggctg gcgttcggcg tccacgtcat catctccacg ccacgctgga 
701 cagagctgaa gtcgcgtgtt cgcgactacc tcggcaccaa gatcgagttc 
751 cggcttggtg acgtcaatga aacccagatc gaccggatta cccgcgagat 
801 cccggcgaat cgtccgggtc gggcagtgtc gatggaaaag caccatctga 
851 tgatcggcgt gcccaggttc gacggcgtgc acagcgccga taacctggtg 
901 gaggcgatca ccgcgggggt gacgcagatc gcttcccagc acaccgaaca 
951 ggcacctccg gtgcgggt cc tgccggagcg tatccacctg cacgaactcg 
1001 acccgaaccc gccgggacca gagtccgact accgcactcg ctgggagatt 
1051 ccgatcggct tgcgcgagac ggacctgacg ccggctcact gccacatgca 
1101 cacgaacccg cacctactga tcttcggtgc ggccaaatcg ggcaagacga 
1151 ccattgccca cgcgatcgcg cgcgccattt gtgcccgaaa cagtccccag 
1201 caggtgcggt tcatgctcgc ggactaccgc tcgggcctgc tggacgcggt 
1251 gccggacacc catctgctgg gcgccggcgc gatcaaccgc aacagcgcgt 
1301 cgctagacga ggccgttcaa gcactggcgg tcaacctgaa gaagcggttg 
1351 ccgccgaccg acctgacgac ggcgcagcta cgctcgcgtt cgtggtggag 
1401 cggatttgac gtcgtgcttc tggtcgacga ttggcacatg atcgtgggtg 
1451 ccgccggggg gatgccgccg atggcaccgc tggccccgtt attgccggcg 
1501 gcggcagata tcgggttgca catcattgtc acctgtcaga tgagccaggc 
1551 ttacaaggca accatggaca agttcgtcgg cgccgcattc gggtcgggcg 
1601 ctccgacaat gttcctttcg ggcgagaagc aggaattccc atccagtgag 
1651 ttcaaggtca agcggcgccc ccctggccag gcatttctcg tctcgccaga 
1701 cggcaaagag gtcatccagg ccccctacat cgagcctcca gaagaagtgt 
1751 tcgcagcacc cccaagcgcc ggttaa 



mtbn2 

1 atggaaaaaa 

51 gagcgacaac 

101 cggtgaccgg 

151 gcgacggcgt 

201 ggcccaagac 

251 gcacctattc 



tgtcacatga 
gctctgcacg 
gctggttccc 
tcacatcgga 
cagctccacc 
gcaaatcgac 



tccgatcgct 
gcgtgacggc 
gcgggggccg 
gggcatccaa 
gtgcgggcga 
gacggcgccg 



gccgacattg 
cggctcgacg 
atgaggtctc 
ttgctggctt 
agcggtccag 
ccggcgtctt 



gcacgcaagt 
gcgctgacgt 
cgcccaagcg 
ccaatgcatc 
gacgtcgccc 
cgcctaatag 



mtbn3 

1^ atgctgtggc acgcaatgcc accggagcta aataccgcac ggctgatggc 
5i cggcgcgggt ccggctccaa tgcttgcggc ggccgcggga tggcagacgc 
101 tttcggcggc tctggacgct caggccgtcg agttgaccgc gcgcctgaac 
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151 tctctgggag aagcctggac tggaggtggc agcgacaagg cgcttgcggc 

201 tgcaacgccg atggtggtct ggctacaaac cgcgtcaaca caggccaaga 

251 cccgtgcgat gcaggcgacg gcgcaagccg cggcatacac ccaggccatg 

301 gccacgacgc cgtcgctgcc ggagatcgcc gccaaccaca tcacccaggc 

351 .. cgtccttacg gccaccaact tcttcggtat caacacgatc ccgatcgcgt 

401 tgaccgagat ggattatttc atccgtatgt ggaaccaggc agccctggca 

451 atggaggtct accaggccga gaccgcggtt aacacgcttt tcgagaagct 

501 cgagccgatg gcgtcgatcc ttgatcccgg cgcgagccag agcacgacga 

551 acccgatctt cggaatgccc tcccctggca gctcaacacc ggttggccag 

601 ttgccgccgg cggctaccca gaccctcggc caactgggtg agatgagcgg 

651 cccgatgcag cagctgaccc agccgctgca gcaggtgacg tcgttgttca 

701 gccaggtggg cggcaccggc ggcggcaacc cagccgacga ggaagccgcg 

751 cagatgggcc tgctcggcac cagtccgctg tcgaaccatc cgctggctgg 

801 tggatcaggc cccagcgcgg gcgcgggcct gctgcgcgcg gagtcgctac 

851 ctggcgcagg tgggtcgttg acccgcacgc cgctgatgtc tcagctgatc 

901 gaaaagccgg ttgccccctc ggtgatgccg gcggctgctg ccggatcgtc 

951 ggcgacgggt ggcgccgctc cggtgggtgc gggagcgatg ggccagggtg 

1001 cgcaatccgg cggctccacc aggccgggtc tggtcgcgcc ggcaccgctc 

1051 gcgcaggagc gtgaagaaga cgacgaggac gactgggacg aagaggacga 



mtbn4 

1 atggcagaga tgaagaccga tgccgctacc ctcgcgcagg aggcaggtaa 

51 tttcgagcgg atctccggcg acctgaaaac ccagatcgac caggtggagt 

101 cgacggcagg ttcgttgcag ggccagtggc gcggcgcggc ggggacggcc 

151 gcccaggccg cggtggtgcg cttccaagaa gcagccaata agcagaagca 

201 ggaactcgac gagatctcga cgaatattcg tcaggccggc gtccaatact 

251 cgagggccga cgaggagcag cagcaggcgc tgtcctcgca aatgggcttc 



mtbn.5 

1 atggcggccg actacgacaa gctcttccgg ccgcacgaag gtatggaagc 
51 tccggacgat atggcagcgc agccgttctt cgaccccagt gcttcgtttc 
101 cgccggcgcc cgcatcggca aacctaccga agcccaacgg ccagactccg 
151 cccccgacgt ccgacgacct gtcggagcgg ttcgtgtcgg ccccgccgcc 
201 gccaccccca cccccacctc cgcctccgcc aactccgatg ccgatcgccg 
251 caggagagcc gccctcgccg gaaccggccg catctaaacc acccacaccc 
301 cccatgccca tcgccggacc cgaaccggcc ccacccaaac cacccacacc 
351 ccccatgccc atcgccggac ccgaaccggc cccacccaaa ccacccacac 
401 ctccgatgcc catcgccgga cctgcaccca ccccaaccga atcccagttg 
451 gcgcccccca gaccaccgac accacaaacg ccaaccggag cgccgcagca 
501 accggaatca ccggcgcccc acgtaccctc gcacgggcca catcaacccc 
551 ggcgcaccgc accagcaccg ccctgggcaa agatgccaat cggcgaaccc 
601 ccgcccgctc cgtccagacc gtctgcgtcc ccggccgaac caccgacccg 
gcctgccccc caacactccc gacgtgcgcg ccggggtcac cgctatcgca 
cagacaccga acgaaacgtc gggaaggtag caactggtcc atccatccag 
gcgcggctgc gggcagagga agcatccggc gcgcagctcg cccccggaac 
ggagccctcg ccagcgccgt tgggccaacc gagatcgtat ctggctccgc 
851 ccacccgccc cgcgccgaca gaacctcccc ccagcccctc gccgcagcgc 
901 aactccggtc ggcgtgccga gcgacgcgtc caccccgatt tagccgccca 
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951 acatgccgcg gcgcaacctg attcaattac ggccgcaacc actggcggtc 
1001 gtcgccgcaa gcgtgcagcg ccggatctcg acgcgacaca gaaatcctta 
1051 aggccggcgg ccaaggggcc gaaggtgaag aaggtgaagc cccagaaacc 
1101 gaaggccacg aagccgccca aagtggtgtc gcagcgcggc tggcgacatt 
1151 gggtgcatgc gttgacgcga atcaacctgg gcctgtcacc cgacgagaag 
1201 tacgagctgg acctgcacgc tcgagtccgc cgcaatcccc gcgggtcgta 
1251 tcagatcgcc gtcgtcggtc tcaaaggtgg ggctggcaaa accacgctga 
1301 cagcagcgtt ggggtcgacg ttggctcagg tgcgggccga ccggatcctg 
1351 gctctagacg cggatccagg cgccggaaac ctcgccgatc gggtagggcg 
1401 acaatcgggc gcgaccatcg ctgatgtgct tgcagaaaaa gagctgtcgc 
1451 actacaacga catccgcgca cacactagcg tcaatgcggt caatctggaa 
1501 gtgctgccgg caccggaata cagctcggcg cagcgcgcgc tcagcgacgc 
1551 cgactggcat ttcatcgccg atcctgcgtc gaggttttac aacctcgtct 
1601 tggctgattg tggggccggc ttcttcgacc cgctgacccg cggcgtgctg 
1651 tccacggtgt ccggtgtcgt ggtcgtggca agtgtctcaa tcgacggcgc 
1701 acaacaggcg tcggtcgcgt tggactggtt gcgcaacaac ggttaccaag 
1751 atttggcgag ccgcgcatgc gtggtcatca atcacatcat gccgggagaa 
1801 cccaatgtcg cagttaaaga cctggtgcgg catttcgaac agcaagttca 
1851 acccggccgg gtcgtggtca tgccgtggga caggcacatt gcggccggaa 
1901 ccgagatttc actcgacttg ctcgacccta tctacaagcg caaggtcctc 
1951 gaattggccg cagcgctatc cgacgatttc gagagggctg gacgtcgttg 



mtbn6 

1 ttgagcgcac ctgctgttgc tgctggtcct accgccgcgg gggcaaccgc 
51 tgcgcggcct gccaccaccc gggtgacgat cctgaccggc agacggatga 
101 ccgatttggt actgccagcg gcggtgccga tggaaactta tattgacgac 
151 accgtcgcgg tgctttccga ggtgttggaa gacacgccgg ctgatgtact 
201 cggcggcttc gactttaccg cgcaaggcgt gtgggcgttc gctcgtcccg 
251 gatcgccgcc gctgaagctc gaccagtcac tcgatgacgc cggggtggtc 
301 gacgggtcac tgctgactct ggtgtcagtc agtcgcaccg agcgctaccg 
351 accgttggtc gaggatgtca tcgacgcgat cgccgtgctt gacgagtcac 
401 ctgagttcga ccgcacggca ttgaatcgct ttgtgggggc ggcgatcccg 
451 cttttgaccg cgcccgtcat cgggatggcg atgcgggcgt ggtgggaaac 
501 tgggcgtagc ttgtggtggc cgttggcgat tggcatcctg gggatcgctg 
551 tgctggtagg cagcttcgtc gcgaacaggt tctaccagag cggccacctg 
601 gccgagtgcc tactggtcac gacgtatctg ctgatcgcaa ccgccgcagc 
651 gctggccgtg ccgttgccgc gcggggtcaa ctcgttgggg gcgccacaag 
701 ttgccggcgc cgctacggcc gtgctgtttt tgaccttgat gacgcggggc 
751 ggccctcgga agcgtcatga gttggcgtcg tttgccgtga tcaccgctat 
801 cgcggtcatc gcggccgccg ctgccttcgg ctatggatac caggactgqg 
851 tccccgcggg ggggatcgca ttcgggctgt tcattgtgac gaatgcggcc 
901 aagctgaccg tcgcggtcgc gcggatcgcg ctgccgccga ttccggtacc 
951 cggcgaaacc gtggacaacg aggagttgct cgatcccgtc gcgaccccgg 
1001 aggctaccag cgaagaaacc ccgacctggc aggccatcat cgcgtcggtq 
1051 cccgcgtccg cggtccggct caccgagcgc agcaaactgg ccaagcaact 
rrrr tct Satcgga tacgtcacgt cgggcaccct gattctggct gccggtgcca 
Uti tc 9 c 99 tc g t ggtgcgcggg cacttctttg tacacagcct ggtggtcqcg 
1201 ggtttgatca cgaccgtctg cggatttcgc tcgcggcttt acgccgagcg 
ftg^Stgcg tgggcgttgc tggcggcgac ggtcgcgatt ccgacgggtc 
1301 tgacggccaa actcatcatc tggtacccgc actatgcctg gctgttgttg 
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1351 agcgtctacc tcacggtagc cctggttgcg ctcgtggtgg tcgggtcgat 
1401 ggctcacgtc cggcgcgttt caccggtcgt aaaacgaact ctggaattga 
1451 tcgacggcgc catgatcgct gccatcattc ccatgctgct gtggatcacc 
1501 ggggtgtacg acacggtccg caatatccgg ttctga 



mtbn7 

1 atggctgaac cgttggccgt cgatcccacc ggcttgagcg cagcggccgc 

51 gaaattggcc ggcctcgttt ttccgcagcc tccggcgccg atcgcggtca 

101 gcggaacgga ttcggtggta gcagcaatca acgagaccat gccaagcatc 

151 gaatcgctgg tcagtgacgg gctgcccggc gtgaaagccg ccctgactcg 

201 aacagcatcc aacatgaacg cggcggcgga cgtctatgcg aagaccgatc 

251 agtcactggg aaccagtttg agccagtatg cattcggctc gtcgggcgaa 

3 01 ggcctggctg gcgtcgcctc ggtcggtggt cagccaagtc aggctaccca 

351 gctgctgagc acacccgtgt cacaggtcac gacccagctc ggcgagacgg 

401 ccgctgagct ggcaccccgt gttgttgcga cggtgccgca actcgttcag 

451 ctggctccgc acgccgttca gatgtcgcaa aacgcatccc ccatcgctca 

501 gacgatcagt caaaccgccc aacaggccgc ccagagcgcg cagggcggca 

551 gcggcccaat gcccgcacag cttgccagcg ctgaaaaacc ggccaccgag 

601 caagcggagc cggtccacga agtgacaaac gacgatcagg gcgaccaggg 

651 cgacgtgcag ccggccgagg tcgttgccgc ggcacgtgac gaaggcgccg 

701 gcgcatcacc gggccagcag cccggcgggg gcgttcccgc gcaagccatg 

751 gataccggag ccggtgcccg cccagcggcg agtccgctgg cggcccccgt 

801 cgatccgtcg actccggcac cctcaacaac cacaacgttg tag 



mtbn8 

1 atgagtatta ccaggccgac gggcagctat gccagacaga tgctggatcc 
51 gggcggctgg gtggaagccg atgaagacac tttctatgac cgggcccagg 
101 aatatagcca ggttttgcaa agggtcaccg atgtattgga cacctgccgc 
151 cagcagaaag gccacgtctt cgaaggcggc ctatggtccg gcggcgccgc 

2 01 caatgctgcc aacggcgccc tgggtgcaaa catcaatcaa ttgatgacgc 
251 tgcaggatta tctcgccacg gtgattacct ggcacaggca tattgccggg 

3 01 ttgattgagc aagctaaatc cgatatcggc aataatgtgg atggcgctca 
351 acgggagatc gatatcctgg agaatgaccc tagcctggat gctgatgagc 
401 gccataccgc catcaattca ttggtcacgg cgacgcatgg ggccaatgtc 
451 agtctggtcg ccgagaccgc tgagcgggtg ctggaatcca agaattggaa 
501 acctccgaag aacgcactcg aggatttgct tcagcagaag tcgccgccac 
551 ccccagacgt gcctaccctg gtcgtgccat ccccgggcac accgggcaca 
601 ccgggaaccc cgatcacccc gggaaccccg' atcaccccgg gaaccccaat 
651 cacacccatc ccgggagcgc cggtaactcc gatcacacca acgcccggca 
701 ctcccgtcac gccggtgacc ccgggcaagc cggtcacccc ggtgaccccg 
751 gtcaaaccgg gcacaccagg cgagccaacc ccgatcacgc cggtcacccc 
801 cccggtcgcc ccggccacac cggcaacccc ggccacgccc gttaccccag 
851 ctcccgctcc acacccgcag ccggctccgg caccggcgcc atcgcctggg 
901 ccccagccgg ttacaccggc cactcccggt ccgtctggtc cagcaacacc 
951 gggcacccca gggggcgagc cggcgccgca cgtcaaaccc gcggcgttgg 
1001 cggagcaacc tggtgtgccg ggccagcatg cgggcggggg gacgcagtcg 
1051 gggcctgccc atgcggacga atccgccgcg tcggtgacgc cggctgcggc 
1101 gtccggtgtc ccgggcgcac gggcggcggc cgccgcgccg agcggtaccg 
1151 ccgtgggagc gggcgcgcgt tcgagcgtgg gtacggccgc ggcctcgggc 
1201 gcggggtcgc atgctgccac tgggcgggcg ccggtggcta cctcggacaa 
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1251 ggcggcggca ccgagcacgc gggcggcctc ggcgcggacg gcacctcctg 
1301 cccgcccgcc gtcgaccgat cacatcgaca aacccgatcg cagcgagtct 
1351 gcagatgacg gtacgccggt gtcgatgatc ccggtgtcgg cggctcgggc 
1401 ggcacgcgac gccgccactg cagctgccag cgcccgccag cgtggccgcg 
1451 gtgatgcgct gcggttggcg cgacgcatcg cggcggcgct caacgcgtcc 
1501 gacaacaacg cgggcgacta cgggttcttc tggatcaccg cggtgaccac 
1551 cgacggttcc atcgtcgtgg ccaacagcta tgggctggcc tacatacccg 
1601 acgggatgga attgccgaat aaggtgtact tggccagcgc ggatcacgca 
1651 atcccggttg acgaaattgc acgctgtgcc acctacccgg ttttggccgt 
1701 gcaagcctgg gcggctttcc acgacatgac gctgcgggcg gtgatcggta 
1751 ccgcggagca- gttggccagt tcggatcccg gtgtggccaa gattgtgctg 
1801 gagccagatg acattccgga gagcggcaaa atgacgggcc ggtcgcggct 
1851 ggaggtcgtc gacccctcgg cggcggctca gctggccgac actaccgatc 
1901 agcgtttgct cgacttgttg ccgccggcgc cggtggatgt caatccaccg 
1951 ggcgatgagc ggcacatgct gtggttcgag ctgatgaagc ccatgaccag 
2001 caccgctacc ggccgcgagg ccgctcatct gcgggcgttc cgggcctacg 
2051 ctgcccactc acaggagatt gccctgcacc aagcgcacac tgcgactgac 
2101 gcggccgtcc agcgtgtggc cgtcgcggac tggctgtact ggcaatacgt 
2151 caccgggttg ctcgaccggg ccctggccgc cgcatgctga 
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